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Cadherin EGF LAG seven-pass G-type receptor 

2 (NP_001399) 



Accepted Sequence 

MRSPATGVPL PTPPPPLLLL LLLLLPPPLL GDQVGPCRSL GSRGRGSSGA CAPMGWLCPS 60 

SASNLWLYTS RCRDAGTELT GHLVPHHDGL RVWCPESEAH IPLPPAPEGC PWSCRLLGIG 12 0 

GHLSPQGKLT LPEEHPCLKA PRLRCQSCKL AQAPGLRAGE RSPEESLGGR RKRNVNTAPQ 180 

FQPPSYQATV PENQPAGTPV ASLRAIDPDE GEAGRLEYTM DALFDSRSNQ FFSLDPVTGA 240 

VTTAEELDRE TKSTHVFRVT AQDHGMPRRS ALATLTILVT DTNDHDPVFE QQEYKESLRE 3 00 

NLEVGYEVLT VRATDGDAPP NANILYRLLE GSGGSPSEVF EIDPRSGVIR TRGPVDREEV 3 60 

ESYQLTVEAS DQGRDPGPRS TTAAVFLSVE DDNDNAPQFS EKRYWQVRE DVTPGAPVLR 42 0 

VTASDRDKGS NAWHYSIMS GNARGQFYLD AQTGALDWS PLDYETTKEY TLRVRAQDGG 480 

RPPLSNVSGL VTVQVLDIND NAPIFVSTPF QATVLESVPL GYLVLHVQAI DADAGDNARL 54 0 

EYRLAGVGHD FPFTINNGTG WISVAAELDR EEVDFYSFGV EARDHGTPAL TASASVSVTV 60 0 

LDVNDNNPTF TQPEYTVRLN EDAAVGTSW TVSAVDRDAH SVITYQITSG NTRNRFSITS 660 

QSGGGLVSLA LPLDYKLERQ YVLAVTASDG TRQDTAQIW NVTDANTHRP VFQSSHYTVN 72 0 

VNEDRPAGTT WLISATDED TGENARITYF MEDSIPQFRI DADTGAVTTQ AELDYEDQVS 780 

YTLAITARDN GIPQKSDTTY LEILVNDVND NAPQFLRDSY QGSVYEDVPP FTSVLQISAT 84 0 

DRDSGLNGRV FYTFQGGDDG DGDFIVESTS GIVRTLRRLD RENVAQYVLR AYAVDKGMPP 900 

ARTPMEVTVT VLDVNDNPPV FEQDEFDVFV EENSPIGLAV ARVTATDPDE GTNAQIMYQI 960 

VEGNIPEVFQ LDIFSGELTA LVDLDYEDRP EYVLVIQATS APLVSRATVH VRLLDRNDNP 102 0 

PVLGNFEILF NNYVTNRSSS FPGGAIGRVP AHDPDISDSL TYSFERGNEL SLVLLNASTG 1080 

ELKLSRALDN NRPLEAIMSV LVSDGVHSVT AQCALRVTII TDEMLTHSIT LRLEDMSPER 114 0 

FLSPLLGLFI QAVAATLATP PDHVWFNVQ RDTDAPGGHI LNVSLSVGQP PGPGGGPPFL 12 00 

PSEDLQERLY LNRSLLTAIS AQRVLPFDDN ICLREPCENY MRCVSVLRFD SSAPFIASSS 1260 

VLFRPIHPVG GLRCRCPPGF TGDYCETEVD LCYSRPCGPH GRCRSREGGY TCLCRDGYTG 132 0 

EHCEVSARSG RCTPGVCKNG GTCVNLLVGG FKCDCPSGDF EKPYCQVTTR SFPAHSFITF 1380 

RGLRQRFHFT LALSFATKER DGLLLYNGRF NEKHDFVALE VIQEQVQLTF SAGESTTTVS 1440 

PFVPGGVS0G QWHTVQLKYY NKPLLGQTGL PQGPSEQKVA WTVDGCDTG VALRFGSVLG 1500 

NYSCAAQGTQ GGSKKSLDLT GPLLLGGVPD LPESFPVRMR QFVGCMRNLQ VDSRHIDMAD 1560 

FIANNGTVPG CPAKKNVCDS NTCHNGGTCV NQWDAFSCEC PLGFGGKSCA QEMANPQHFL 162 0 

GSSLVAWHGL SLPISQPWYL SLMFRTRQAD GVLLQAITRG RSTITLQLRE GHVMLSVEGT 168 0 

GLQASSXiRLE PGRANDGDWH HAQLALGASG GPGHAILSFD YGQQRAEGNL GPRLHGLHLS 174 0 

NITVGGIPGP AGGVARGFRG CLQGVRVSDT PEGVNSLDPS HGESINVEQG CSLPDPCDSN 1800 

PCPANSYCSN DWDSYSCSCD PGYYGDNCTN VCDLNPCEHQ SVCTRKPSAP HGYTCECPPN 1860 

YLGPYCETRI DQPCPRGWWG HPTCGPCNCD VSKGFDPDCN KTSGECHCKE NHYRPPGSPT 1920 

CLLCDCYPTG SLSRVCDPED GQCPCKPGVI GRQCDRCDNP FAEA/TTNGCE VNYDSCPRAI 198 0 

EAGIWWPRTR FGLPAAAPCP KGSFGTAVRH CDEHRGWLPP NLFNCTSITF SELKGFAERL 2040 

QRNESGLDSG RSQQLALLLR NATQHTAGYF GSDVKVAYQL ATRLLAHEST QRGFGLSATQ 2100 

DVHFTENIiliR VGSALLDTAN KRHWELIQQT EGGTAWLLQH YEAYASALAQ NMRHTYLSPF 2160 

TIVTPNIVIS WRLDKGNFA GAKLPRYEAL RGEQPPDL.ET TVILPESVFR ETPPWRPAG 2220 
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PGEAQEPEEL ARRQRRHPEL SQGEAVASVI lYRTLAGLLP HNYDPDKRSL RVPKRPIINT 22 80 

PWSISVHDD EELLPRALDK PVTVQFRLLE TEERTKPICV FWNHSILVSG TGGWSARGCE 2340 

WFRNESHVS CQCNHMTSFA VLMDVSRREN GEILPLKTLT YVALGVTLAA LLLTFFFLTL 24 00 

LRILRSNQHG IRRNLTAALG loAQLVFLLGI NQADLPFACT VIAILLHFLY LCTFSWALLE 2460 

ALHLYRALTE VRDVNTGPMR FYYMLGWGVP AFITGLAVGL DPEGYGNPDF CWLSIYDTLI 2520 

WSFAGPVAFA VSMSVFLYIL AARASCAAQR QGFEKKGPVS GLQPSFAVLL LLSATWLLAL 25 80 

LSVNSDTLLF HYLFATCNCI QGPFIFLSYV VLSKEVRKAL KLACSRKPSP DPALTTKSTL 264 0 

TSSYNCPSPY ADGRLYQPYG DSAGSLHSTS RSGKSQPSYI PFLLREESAL NPGQGPPGLG 2700 

DPGSLFLEGQ DQQHDPDTDS DSDLSLEDDQ SGSYASTHSS DSEEEEEEEE EEAAFPGEQG 2760 

WDSLLGPGAE rYpLHSTPKD' GGPGPGKAPW PGDFGTTAKE * SSGNG^^ LRENGDALSR 282"0 

EGSLGPLPGS SAQPHKGILK KKCLPTISEK SSLLRLPLEQ CTGSSRGSSA SEGSRGGPPP 28 80 

RPPPRQSLQE QLNGVMPIAM SIKAGTVDED SSGSEFLFFN FLH 292 3 



Molecular Weight = 317469.84 



Chris Putna m 
cdputnam@scripps. edu 
Graduate Student 
The Scripps Research Institute 



Last Updated: May 26, 1999 



http://www.scripps.edu/cgi-bin/cdputnam/protcalc 



6/7/04 



NCBI Sequence Viewer 



BEST AVAILABLE COPY 



Page 1 of 4 



NCBI 



Entrez PubMed 

Search! Protein 



default 




Nucleotide 



Protein 



fori 



Genome 



Structure 



Limits 



Preview/Index 



Show: 



11: NP_001399 . cadherin EGF LAG ,..[gi:13325064] 



History 



PMC 



Taxonomy 



Clipboard 



^BiQe||Sj|b[se^u 



Boo 



Details 

i 



jgeat 



BLink, Domains, Links 



LOCUS 

DEFINITION 



ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 



NP_001399 2923 aa linear PRI 20-DEC-2003 

cadherin EGF LAG seven-pass G-type receptor 2; EGF-like-domain, 
multiple 2; epidermal growth factor- like 2; multiple epidermal 
growth factor- like domains 3 [Homo sapiens] . 
NP_001399 

NP_001399.1 GI: 13325064 
REFSEQ: accession NM 001408.1 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (residues 1 to 2923) 

Vincent , J.B . , Skaug,J. and Scherer,S.W. 

The human homologue of flamingo, EGFL2, encodes a brain-expressed 
large cadherin-like protein with epidermal growth factor-like 
domains, and maps to chromosome Ipl3.3-p21.1 
DNA Res. 7 (3), 233-235 (2000) 
10907856 

2 (residues 1 to 2923) 
Wu,Q. and Maniatis,T. 

Large exons encoding multiple ectodomains are a characteristic 
feature of protocadherin genes 

Proc. Natl. Acad. Sci. U.S.A. 97 (7), 3124-3129 (2000) 
10716726 

3 (residues 1 to 2923) 
Wu,Q. and Maniatis,T. 

A striking organization of a large family of human neural 
cadherin-like cell adhesion genes 
Cell 97 (6), 779-790 (1999) 
10380929 



1 to 2923) 

Naka j ima , D . , Nagase , T . 



Nomura, N., Seki,N. and 



4 (residues 
Nakayama , M . , 
Ohara, O . 

Identification of high-molecular-weight proteins with multiple 
EGF-like motifs by motif-trap screening 
Genomics 51 (1), 27-34 (1998) 
96930 30 

REVIEWED REFS EQ : This record has been curated by NCBI staff. The 
reference sequence was derived from AF2 348 87 . 1 . 



Summary: The protein encoded by this gene is a member of the 
flamingo subfamily, part of the cadherin superfamily. The flamingo 
subfamily consists of none las sic -type cadherins; a subpopulation 
that does not interact with catenins . The flamingo cadherins are 
located at the plasma membrane and have nine cadherin domains, 
seven epidermal growth factor-like repeats and two laminin A G-type 
repeats in their ectodomain. They also have seven transmembrane 
domains, a characteristic unique to this subfamily. It is 
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postulated that these proteins are receptors involved in 
contact -mediated communication, with cadherin domains acting as 
hemophilic binding regions and the EGF-like domains involved in 
cell adhesion and receptor-ligand interactions. The specific 
function of this particular member has not been determined. 
FEATURES Location/Qualifiers 
source 1 . .2923 

/organism="Homo sapiens" 
/ db_xr e f = " t axon : 9 6 0 6 " 
/chromosome= " 1 " 
/map="lp21" 
Protein 1 . .2923 

/product= "cadherin EGF LAG seven-pass G-type receptor 2" 
/note=" EGF-like -domain, multiple 2; epidermal growth 
factor-like 2; multiple epidermal growth factor-like 
domains 3 " 
Region 31 . .2767 

/region_name=" Cadherin EGF LAG seven-pass G-type receptor 
[Signal transduction mechanisms] " 
/note="KOG4289" 
/ db_xr e f = " CDD : 22065 " 
Region 185 . .391 

/region_name= "Cadherin repeat domain" 
/note="CA" 

/db_xref = "CDD : 14 772 " 
Region 293 . .497 

/region_name=" Cadherin repeat domain" 
/note="CA" 

/ db_xr e f = " CDD : 14772 " 
E^SiPI}: 404 . . 602 

/region_name=" Cadherin repeat domain" 
/note="CA" 

/ db_xr e f = " CDD : 14772 " 
Region 509.. 704 

/region_name=" Cadherin repeat domain" 
/note="CA" 

/ db_xr e f = " CDD : 14772 " 
Region 614.. 807 

/region_name= "Cadherin repeat domain" 
/note="CA" 

/ db_xr e f = " CDD : 14772 " 
Region 716 . . 913 

/ r eg ion_name= "Cadherin repeat domain" 
/note="CA" 

/ db_xr e f = " CDD : 14772 " 
Region 819.. 1015 

/region_name= "Cadherin repeat domain" 
/note="CA" 

/ db_xr e f = " CDD : 14772 " 
Re gion 933.. 1119 

/region_name= "Cadherin repeat domain" 
/note="CA" 

/ db_xr e f = " CDD : 14772 " 
Region <1264..>1367 

/region_name= "Fibrillins and related proteins containing 

Ca2+-binding EGF-like domains [Signal transduction 

mechanisms] " 

/note="KOG1217" 

/ db_xr e f = " CDD : 19006 " 
Region 1369.. 1552 
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Region 



Re gion 



Region 



Region 



CDS 



/region_name="Laminin G domain" 

/note="LamG" 

/ db_xr e f = " CDD : 5359 " 

1576. .1609 

/region_name= "Calcium-binding EGF-like domain, present in 

a large number of membrane -bound and extracellular (mostly 

animal) proteins" 

/note="EGF_CA" 

/ db_xr e f = " CDD : 14791 " 

1618. .1767 

/region_name="Laminin G domain" 

/note="LamG" 

/ db_xr e f = " CDD : 5359 " 

<1791. .1982 

/region_name="Fibrillins and related proteins containing 

Ca2+-binding EGF-like domains [Signal transduction 

mechanisms] " 

/note="K0G1217" 

/ db_xr e f = " CDD : 19006 " 

1972.. 2034 

/ r eg ion_name= "Domain present in hormone receptors" 
/note="HormR" 
/db_xref = "CDD : 22660 " 
2315 . .2368 

/region_name="G-protein-coupled receptor proteolytic site 

domain" 

/note="GPS" 

/ db_xr e f = " CDD : 22750 " 

2402. .2616 

/region_name="7 transmembrane receptor (Secretin family) " 

/note="7tm_2 " 

/ db_xr e f = " CDD : 571 " 

1..2923 

/gene="CELSR2" 

/coded_by="NM_001408 .1:63.. 8834" 

/note="go_component : integral to membrane [gold 0016021 ] 
[evidence NAS] [pmid 10907856] ; 

go_f unction: G-protein coupled receptor activity [gold 
0004930 ] [evidence NAS] [pmid 10907856] / 

go_f unction: structural molecule activity [goid 000519 8] 
[evidence lEA] ; 

go_f unction: calcium ion binding [goid 0005509] [evidence 
lEA] / 

go_process: neuropeptide signaling pathway [goid 0007218 ] 
[evidence lEA] ; 

go_j)rocess: hemophilic cell adhesion [goid 0007156 ] 
[evidence lEA] ; 

goj)rocess; development [goid 0007275 ] [evidence lEA] " 
/ db_xr e f = " Gene ID : 1952 " 
/db_xref = " Locus ID: 19^52 " 
/db xref="MIM: 604265 " 



ORIGIN 



1 mrspatgvpl 
61 sasnlwlyts 
121 ghlspqgklt 
181 fqppsyqatv 
241 vttaeeldre 
301 nlevgyevlt 
361 esyqltveas 
421 vtasdrdkgs 



ptppppllll 
rcrdagtelt 
Ipeehpclka 
penqpagtpv 
tksthvf rvt 
vratdgdapp 
dqgrdpgprs 
nawhysims 



lllllpppll 
ghlvphhdgl 
prlrcqsckl 
aslraidpde 
aqdhgmprrs 
nanilyrlle 
ttaavf Isve 
gnargqfyld 



gdqvgpcrsl 
rvwcpeseah 
aqapglrage 
geagrleytm 
alatltilvt 
gsggspsevf 
ddndnapqf s 
aqtgaldws 



gsrgrgssga 
iplppapegc 
rspeeslggr 
dalf dsrsnq 
dtndhdpvf e 
eidprsgvir 
ekrywqvre 
pldyettkey 



capmgwlcps 
pwscrllgig 
rkrnvntapq 
f f sldpvtga 
qqeykeslre 
trgpvdreev 
dvtpgapvlr 
tlrvraqdgg 
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481 rpplsnvsgl vtvcjvldind napifvstpf qatvlesvpl gylvlhvqai dadagdnarl 
541 eyrlagvghd fpftinngtg wisvaaeldr eevdfysfgv eardhgtpal tasasvsvtv 
601 Idvndnnptf tqpeytvrln edaavgtsw tvsavdrdah svityqitsg ntrnrfsits 
661 qsggglvsla Ipldyklerq yvlavtasdg trqdtaqiw nvtdanthrp vfqsshytvn 
721 vnedrpagtt wlisatded tgenarityf tnedsipqfri dadtgavttq aeldyedqvs 
781 ytlaitardn gipqksdtty leilvndvnd napqflrdsy qgsvyedvpp ftsvlqisat 
841 drdsglngrv fytfqggddg dgdfivests givrtlrrld renvaqyvlr ayavdkgmpp 
901 artpmevtvt vldvndnppv feqdefdvfv eenspiglav arvtatdpde gtnaqimyqi 
961 vegnipevfq Idifsgelta Ivdldyedrp eyvlviqats aplvsratvh vrlldrndnp 
1021 pvlgnfeilf nnyvtnrsss fpggaigrvp ahdpdisdsl tysfergnel slvllnastg 
1081 elklsraldn nrpleaimsv Ivsdgvhsvt aqcalrvtii tdemlthsit Irledmsper 
1141 flspllglfi qavaatlatp pdhvwfnvq rdtdapgghi Invslsvgqp pgpgggppfl 
1201 psedlqerly Inrslltais aqrvlpfddn iclrepceny mrcvsvlrfd ssapfiasss 
1261 vlfrpihpvg glrcrcppgf tgdycetevd Icysrpcgph grcrsreggy tclcrdgytg 
1321 ehcevsarsg rctpgvckng gtcvnllvgg fkcdcpsgdf ekpycqvttr sfpahsfitf 
1381 rglrqrfhft lalsfatker dglllyngrf nekhdfvale viqeqvqltf sagestttvs 
1441 pfvpggvsdg qwhtvqlkyy nkpllgqtgl pqgpseqkva wtvdgcdtg valrfgsvlg 
1501 nyscaaqgtq ggskksldlt gplllggvpd Ipesfpvrmr qfvgcmrnlq vdsrhidmad 
1561 fianngtvpg cpakknvcds ntchnggtcv nqwdafscec plgfggksca qemanpqhfl 
1621 gsslvawhgl slpisqpwyl slmfrtrqad gvllqaitrg rstitlqlre ghvmlsvegt 
1681 glqasslrle pgrandgdwh haqlalgasg gpghailsfd ygqqraegnl gprlhglhls 
1741 nitvggipgp aggvargfrg clqgvrvsdt pegvnsldps hgesinveqg cslpdpcdsn 
1801 pcpansycsn dwdsyscscd pgyygdnctn vcdlnpcehq svctrkpsap hgytcecppn 
1861 ylgpycetri dqpcprgwwg hptcgpcncd vskgfdpdcn ktsgechcke nhyrppgspt 
1921 cllcdcyptg slsrvcdped gqcpckpgvi grqcdrcdnp faevttngce vnydscprai 
1981 eagiwwprtr fglpaaapcp kgsfgtavrh cdehrgwlpp nlfnctsitf selkgfaerl 
2041 qrnesgldsg rsqqlalllr natqhtagyf gsdvkvayql atrllahest qrgfglsatq 
2101 dvhftenllr vgsalldtan krhweliqqt eggtawllqh yeayasalaq nmrhtylspf 
2161 tivtpnivis wrldkgnfa gaklpryeal rgeqppdlet tvilpesvfr etppwrpag 
2221 pgeaqepeel arrqrrhpel sqgeavasvi iyrtlagllp hnydpdkrsl rvpkrpiint 
2281 pwsisvhdd eellpraldk pvtvqfrlle teertkpicv fwnhsilvsg tggwsargce 
2341 wfrneshvs cqcnhmtsfa vlmdvsrren geilplktlt yvalgvtlaa llltfffltl 
2401 Irilrsnqhg irrnltaalg laqlvfllgi nqadlpfact viaillhfly Ictfswalle 
2461 alhlyralte vrdvntgpmr fyymlgwgvp afitglavgl dpegygnpdf cwlsiydtli 
2521 wsfagpvafa vsmsvflyil aarascaaqr qgfekkgpvs glqpsfavll llsatwllal 
2581 Isvnsdtllf hylfatcnci qgpfiflsyv vlskevrkal klacsrkpsp dpalttkstl 
2 641 tssyncpspy adgrlyqpyg dsagslhsts rsgkscjpsyi pfllreesal npgqgppglg 
2 701 dpgslflegq dqqhdpdtds dsdlsleddq sgsyasthss dseeeeeeee eeaafpgeqg 
2761 wdsllgpgae rlplhstpkd ggpgpgkapw pgdfgttake ssgngapeer Irengdalsr 
2821 egslgplpgs saqphkgilk kkclptisek ssllrlpleq ctgssrgssa segsrggppp 
2881 rppprqslqe qlngvmpiam sikagtvded ssgseflffn flh 
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